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'nI" . Introduction 

Curve registration 

0\\ Our concern is the statistical problem of curve registration, which appears nat- 

■ urally in a large number of applications, when the available data consist of a 
set of noisy, distorted signals that possess a common structure or pattern. This 
pattern constitutes the essential information that we want to dig out from the 
observations. However, the deformations of the signals are generally nonlinear 

■ and relatively complex, which complicates the statistical task. Fortunately it is 
^ I relevant in some cases to assume that the signals only differ from each other by 

a horizontal shift: we call this modeling the shifted curve model. For instance, 
it was successfully adopted for the interpretation of the ElectroCardioGramms: 
each deflection is considered as a repetition of the same signal starting at a 
random time. Isserles et al. [28] proposed an estimator of the common pattern. 
Interestingly, the assumptions on the deformations are in practice violated due 
to the baseline wandering, a periodic vertical perturbation of the potential, but 
the estimation of the structural pattern performs well yet. 

By contrast, SIFT descriptors (c/. Lowe [31]) in computer vision are an ex- 
ample where the specification of the deformations is essential: selected keypoints 
of an image are assigned with descriptors including a histogram of the local gra- 
dient. If the image is rotated, the histogram of each keypoint is simply shifted 
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by the angle of the rotation. To match the keypoints of the two images, it is 
then sufficient to test the adequation of their histograms with the shifted curve 
model. So, testing the model is sometimes the main concern, and even when 
estimation matters, the adequation of the model may have to be tested, as the 
estimation techniques depend on the structure of the deformations. 

We refer to the papers Bigot and Gadat [5], Bigot, Gadat, and Loubes [6], 
Bigot, Gamboa, and Vimond [7], Castillo and Loubes [10], Dalalyan, Golubev, 
and Tsybakov [14], Dalalyan [13] and Gamboa, Loubes, and Maza [19] for re- 
sults on the estimation of different features of the curve registration model. 
The present work builds on Collier and Dalalyan [11], where a comprehensive 
overview can be found. 



Shifted curve model 

This paper deals with the shifted curve model, which we will state in a Gaussian 
sequence form, but which originally relates on two 27r-periodic functions / and 
/* in L2. Expanding these functions in the complex Fourier basis, we get 

+00 +00 
/W= E c,(/)e'^* and/*(t)= ^ c,(/V* for i e [0, 27r], 

where c,(/) = ^ /J^^ dt and c,(/*) = ^ C /*(0e-'^"* dt. 

With this notation, if / and /* only differ from each other by a shift, then 
the Fourier coefficients verify Cj{f*) = e^^'^Cj{f), for some real r in [0,27r] and 
all non-zero integers j. Hence, if we introduce the pseudo-distance d such that 

+00 

d2((ci, . . .), {cl.. .)) ^ inf 5] |c, - e-'^--c*|2, (1) 

and given that Cj{f) = c-j{f) for every integer j, testing that / was shifted 
from /* amounts to testing if d(c, c*) = where c = (ci(/), C2(/), . . .) and 

C* = (Ci(/*),C2(n,...). 

Now, if we assume that the observations are given by the white noise model 

dY{t) = f{t) dt + a dW{t) and dY*{t) = f{t) dt + a dW*{t), 

where a > and W, W* are independent Wiener processes, we can state our 
model in a more convenient Gaussian sequence form: 



I = + < 

where 



y# ^ „# , _f# ' i - 1:2,..., (2) 



• {O'^jii ~ 1;2,...} is a family of independent complex random variables, 
whose real and imaginary parts are independent standard Gaussian variables, 

• (T is assumed to be known. 
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Our problem amounts to testing Hq against Hi with 

H,: d(c,c*) = 0, 

Hi : d{c,c*) > Cp„, ^ ' 

where C is a positive constant and pa- is a sequence of positive real numbers. 
For reasons that we shall explain later, we assume that c and c* belong under 
the alternative to a Sobolev ball 

oo 

with s > 0. With this notation, we denote 8o and Qi the parameter sets corres- 
ponding to the hypotheses Ho and Hi, Y and Y* the sequences (Vi, Y2, . . .) and 
{Y*, Y*, . . .), and we call Pc,c* the probability engendered by {Y, Y*) when the 
parameters are c and c*. 

A detailed discussion of the model is deferred to Section 4, but before that, 
we point out that our choice of the Gaussian sequence model is not restrictive, 
since this model is equivalent in Le Cam's sense to many other models, including 
Gaussian white noise, density estimation (c/. Nussbaum [32]), nonparametric 
regression (c/. Brown and Low [8], in the case of random design in Reifi [34], 
in the case of nonGaussian noise in Grama and Nussbaum [22] and Grama and 
Nussbaum [23]), ergodic diffusion (c/. Dalalyan and Reifi [12]). On the other 
hand, the Gaussian noise is accepted in computer vision as a good approximation 
of the Poisson noise, that is more natural in this context. 



Minimax testing 

A randomized test in our model is a random variable taking values in [0, 1] and 
measurable with respect to the tr-algebra engendered by {Y,Y*). In practice, 
the user simulates an independent random variable with a Bernoulli distribution 
of parameter the value of the test, which was computed from the data {Y, Y*). 
The null hypothesis is accepted, respectively rejected, when the result of the 
simulation is or 1. We say that a test is nonrandomized when it only takes the 
values or 1. 

To measure the performance of a test ip, we choose the minimax point of 
view, in which the errors of first and second kind are defined by 

j Go) = supe^^ Ec,c« (V") , 
I /3(V',ei)=supe,E,,,.(l-7/;). 

Note that in the nonrandomized case, a(V',Oo) = supg P(.^c*{'4' = 1) and 
/3(^,ei) = supe,Pc,c«('/'-0). 

We say that consistent testing in the asymptotic minimax sense is possible if 
for all a, /3 > 0, there exists a test ipa such that 

lim a(ipa-, ©o) < ce, 

a-^ (6) 
lim/3(V',,ei) </3. 

cr— >0 
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The distance between the nuh and the alternative hypotheses, Cpa, determines 
the existence of such tests. Indeed, if Cpa is too small, no testing procedure is 
asymptotically better than a blind guess, for which a('0,So) + ©i) = 1- 
For a fixed pair a, /3, we call p* the asymptotic minimax separation rate if there 
are two positive constants C* and C* such that consistent testing is impossible 
for Per = Pa and C < C*, and possible for p„ — p* and C > C* . The best 
constants C* and C* satisfying these conditions are called exact separation 
constants. Conventionally, one applies the informal minimal writing length rule 
to avoid nonuniqueness of the minimax separation rate and of these constants. 
Moreover, a test which is consistent when p^ = p% and for some C > is called 
asymptotically minimax rate optimal. 

There is a vast literature on the subject of minimax testing: minimax sep- 
aration rates were investigated in many models, including the Gaussian white 
noise model, the regression model, the Gaussian sequence model and the prob- 
ability density model, for the greater part in signal detection, i.e., testing the 
hypothesis "/ = 0" against the alternative "||/|| > Cpa" . We present a selective 
overview of the papers that are the most relevant in the context of this work. 

Starting from Ingster [25], Ermakov [15] and Ermakov [16], where the mini- 
max separation rate and the exact separation constants were obtained when the 
functions in the alternative lie in ellipsoids and the separation from is mea- 
sured by the Z2-norm, various cases were considered: Zp-bodies as well as Sobolev, 
Holder and Besov classes. We refer to Ingster and Suslina [27] and Ingster [26] 
for a survey. The cases when the functions in the alternative set lie in Sobolev 
or Holder classes and the separation from is measured by the sup-norm or by 
their values at a fixed point were studied in Lepski and Tsybakov [30]. Finally, 
the case of the Lp-norm with p < 2 in Besov classes was considered in Lepski 
and Spokoiny [29]. 

Now, all the previously cited results are asymptotic, in the sense that the 
noise level a (in the white noise model) tends to 0. But from a practical point of 
view, it may be interesting to look at the problem from a nonasymptotic point 
of view. In the regression and Gaussian sequence models, Baraud [1] derived 
nonasymptotic minimax separation rates when the functions in the alternative 
lie in /p-bodies (0 < p < 2) and the separation from is measured by the I2- 
norm. Baraud, Huet, and Laurent [2, 3] proposed procedures for testing linear 
or convex hypotheses in the regression model, and Fromont and Levy-Leduc [18] 
inspected the improvement implied by a further hypothesis on the periodicity 
of the signal in the periodic Sobolev balls. 

Composite null hypothesis testing 

Up to here, we have reviewed results dealing mainly with a simple null hypoth- 
esis, namely in the case of signal detection: "/ = 0". In contrast, the testing 
problem in the shifted curve model deals with a composite null hypothesis. Here, 
we give a brief overview of the papers presenting hypothesis testing problems 
with composite null hypotheses. 
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The series of papers Baraud [1], Baraud, Huet, and Laurent [2, 3] tackled the 
case of a nonparametric null hypothesis, but their assumptions are not appli- 
cable in our set-up, since our null hypothesis as defined in (3) is neither linear 
nor convex. On the other hand, the test of a parametric model against a non- 
parametric one was studied in a substantial number of papers ( c/. Horowitz and 
Spokoiny [24] and references therein), but only in Horowitz and Spokoiny [24] 
from a minimax point of view. The minimax separation rate that they obtained 
is the same as with a simple null hypothesis. This is due to the strong assump- 
tions made on the behaviour of the estimator of the parameter characterizing 
the model under Hq. 

On a related note, Gayraud and Pouet [20, 21] treated a more general com- 
posite null hypothesis in the regression model, that is mainly characterized by 
its entropy. In fact, the set of functions in the null hypothesis can grow with the 
sample size, and so be nonparametric. Their rate is the same as in the case of 
a simple hypothesis. Finally, Butucea and Tribouley [9] also considers the case 
of a nonparametric null hypothesis, since Hq is "/ = £(" , where / and g are two 
density functions. 

Adaptive testing 

A limitation of the minimax approach is that the optimal tests depend on the 
smoothness class. This is not convenient from a practical point of view, because 
the choice of the smoothness seems to be unnatural and arbitrary. To obtain 
handier procedures, we need an adaptive definition for hypothesis testing. 

Prior to testing, some sets of smoothness parameters s, L must be chosen, 
over which adaptation is performed. Typically, these sets are taken as compact 
intervals [si, S2], [Li, L2\. To each couple of smoothness parameters (s, i), we as- 
sociate the smoothness set J-s.L^ and we write Oq and O^' the corresponding 
null and alternative hypotheses. Note that, in our problem, Og ^ = Oo is inde- 
pendent of the smoothness parameters, and that O^'^ depends on (s,L), not 
only because c and c* are in Ts,l, but also since p„ is allowed to be a function 
of s: as a matter of fact, 0^'^ depends on the choice of the radius Cpa{s). The 
easiest way to achieve adaptation is to use the test corresponding to the most 
constraining smoothness (51,^2) , but this entails a significant loss of efficiency 
if the tested parameters are in fact smoother. 

Thus, we prefer a more economical approach and we will say that consistent 
adaptive testing is possible uniformly over s S [si, S2] and L G [Li, ^2], if for all 
a, /3 > 0, there is a test ipa depending only on si, S2, Li, L2, a and (3 such that 




lim a{Tpa, Oq) < a, 
Ifi^sup/3(^<„e^^^) </3. 



(7) 



However, adaptive testing is not always possible without loss of efficiency, i.e., 
taking Pais) — Pa{s) for each s. That is why it was suggested in Spokoiny [37] 
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to replace a by crdo- in the expression of p*cr{s)^ where da is a sequence of positive 
real numbers, which can be seen as a necessary payment regarding the intensity 
of the noise to achieve adaptivity. 

Now, we say that p*^ {s),s € [si,S2] is the adaptive asymptotic minimax 
separation rate if there are two positive constants C* and C* such that adaptive 
consistent testing is impossible for Pa{s) — p^d i^) ^^d C < C^,, and possible 
for pa{s) = pld^ (s) and C > C* . 

Spokoiny [37] proved that the optimal asymptotic factor is (log log cr^^)^/'*, 
for signal detection in Besov balls. Gayraud and Pouet [21] extended this result 
for Holder classes in the regression model. 

Fan, Zhang, and Zhang [17] provided a generic tool to construct minimax 
and adaptive minimax tests: the generalized maximum likelihood, that we also 
use in the present work to build our procedures both in the nonadaptive and 
adaptive contexts. 

Our contribution 

The problem considered in the present work is qualitatively different from the 
aforementioned works on the minimax separation rate, since our null hypothesis 
is not only composite but also semiparametric. Furthermore, it seems that the 
finite-dimensional parameter cannot be uniformly consistently estimated, which 
contrasts with the situation of Horowitz and Spokoiny [24] . 

Nevertheless, we propose a testing procedure which is consistent when the 
separation rate is of order (cr^^logcr"^)^*/^^''"^. This rate is then proven to 
be minimax, up to a possible logarithmic factor. Indeed, no testing procedure 
is consistent for a separation rate smaller than (j4s/4s+i^ which is the rate of 
signal detection in the Gaussian sequence model when the signal to be detected 
belongs to a Sobolev ball and the separation from is measured by the Z2-norm. 

Further, an adaptive test is proposed to circumvent the limitations of the 
nonadaptive approach. This test is minimax rate optimal, up to a possible log- 
arithmic factor, uniformly over a family of Sobolev balls. 

Finally, there is a gap between our lower and upper bounds for the asymptotic 
minimax separation rate. It could be argued that the lower bound is suboptimal, 
and that the minimax separation rate for the shifted curve model does contain 
our logarithmic factor. Indeed, the problem of testing the goodness-of-fit of the 
shifted curve model can be regarded as an adaptation to the unknown shift pa- 
rameter. As a matter of fact, if adaptation to the unknown smoothness typically 
entails a loglog-factor, other types of adaptation can bring simple logarithmic 
ones: it is proved in Lepski and Tsybakov [30] that the asymptotic minimax 
separation rate for signal detection when the signal to be detected belongs to a 
Sobolev or Holder ball and the separation from is measured by the sup-norm 
is (CT^-\/log(F^)*/^''+^, while it is 0-2s/2s+i -^^i^en the separation from is mea- 
sured by the value of the signal at a fixed point. The logarithmic factor can be 
interpreted as a payment for the adaptation of the problem of testing at one 
point when this point is unknown. Furthermore, note that the same logarith- 
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mic factor appears in Fromont and Levy-Leduc [18], where upper bounds on 
the minimax separation rate are estabhshed in the problem of periodic signal 
detection with unknown period. 



Organization of the paper 

The rest of this paper is organized as follows: a nonadaptive procedure is pro- 
posed in Section 1, and adjusted in Section 2 to obtain an adaptive test. We 
also state their minimax performances, which Section 3 indicates to be at least 
nearly optimal in the minimax sense. The theorems are proved in Sections 5 
to 7, and the lemmas used in their proofs are presented in Section 8. The model 
is discussed in Section 4. 



1. Nonadaptive testing procedure 

Here, we build a test which will be proven later to be minimax, up to a possible 
logarithmic factor. Indeed, the procedure achieves the rate (CT^\/loga^)^'*/'*'*+^. 

Our proposal, which carries on the work presented in Collier and Dalalyan 
[11], is based on standardized versions Xa{N) of estimators of d{c, c*): 



i'aiN,q)= l{A„(7V)>g}, 



(8) 



for N G W and q e M. Put into words, the test ipa-{N,q) rejects the null 
hypothesis when the statistic Xa{N) exceeds the threshold q and accepts it 
otherwise. The following theorem establishes the minimax properties of this 
testing procedure for a proper choice of the tuning parameters. 



Theorem 1. Set 

eo = {{c,c*)ehxl2\d{c,c')^oY 

&i = {(c,c*) e T,,L X J-,,L I d(c, c*) >Cp^y 



(9) 



with s and L are positive real numbers, pa — [a^\/log a and > 

4L2c-2^ + sj^^, Csx = (4sL2V4iTT)^/'"'+^ Denote the test iJa{N,q) 

defined in (8) with N — N^{s,L) — [cs^l Pa^^'^] and q = qa, the quantile of 
order 1 — a of the standard Gaussian distribution. Then 

Ii^a(V'„,eo) < a, (10) 

(J— >-0 

lim/?(^,,ei) =0. (11) 

Remark 1. In the rest of this section and in the proof, we skip the dependence 
of Ni^(s, L) in s and L when no confusion is possible. 

The proof of this result is given in Section 5. Let us now develop a brief 
heuristic describing how one could have guessed the optimal value of p^. 
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Heuristic for the performance of the nonadaptive procedure 

Our proof will show that, under Hq, Xa-{Na-) is bounded from above in proba- 
bility. Thus, we decide to reject the null hypothesis when X„{Ncr) is larger than 
a constant to be chosen properly. 

On the other hand, we inspect the behaviour of the statistic under the alter- 
native hypothesis and give a condition on pa- under which the test statistic is 
orders of magnitude larger than a constant, so that the procedure can have the 
desired power. 

We derive the lower bound 



A. 

j=i i=i 



:max^Re(e'^^Cj(* 



i=i 



negligible terms. 



2VN^ 

The proof will establish that the second term is bounded in probability, while 
the third, that we call perturbative, is of order ^/logN^. The first term, up 
to a 4:\/Na-cr'^ factor, is an approximation of the square of the pseudo-distance 
rf(c, c*). Since c and c* lie in Jvs,l, the remainder of the sum can be bounded 
from above, up to a constant factor, by N^'^'' . In a nutshell, we get the heuristical 
lower bound 



KiN^) > Cste ■ ^ ' /ir^ p Op (Vlog^) 

Consequently, the alternative is detected as soon as 



pI » max N'^' , ^N, log N„ ^ a^v/log^ 



r-l 



Heuristic for the constant C 



We may now ask how small the constant C can be without making our testing 
procedure inefficient. This constant is only optimized for our test, and we do 
not claim it to be optimal in the minimax sense. 

The previous optimization shows that the test achieves its best rate when N„ 
is of the order of p*"^^". Now, denoting = [cp*~^^^], a similar heuristic can 
give an optimized constant C in the definition of Oi. Indeed, Lemma 6 gives 
the more precise lower bound (C^ - AL^c-^'')pl^ for the sum in the first term, 

and we will prove the exact order of magnitude of the third to be ^J^^^\ogNl. 
Thus 

and this leads to a minimization problem determining the choice of c (c/. The- 
orem 1). 
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2. Adaptive testing procedure 

The procedure given in the previous section possesses asymptotic minimax op- 
timality properties thanks to an appropriate choice of the tuning parameter N^, 
but the practician needs to determine values of s and L to implement the test. 
As it seems arbitrary and nonintuitive to make assumptions on the smoothness 
of the signals, it is necessary to design testing procedures independent of s and 
L that are nearly as good, in the minimax sense, as the procedure proposed in 
the previous section. 

In this section, we only assume that an interval [si,S2] is available such 
that c, c* e J^s,L for some s S [si,S2] and L £ [0,+oo[. We propose a testing 
procedure depending on si and S2 but independent of s and L, that achieves 
the same rate of separation, i.e., (cr^ -\/log ^-i j ^ g^g ^j^^ ^^g^ based on 
the precise knowledge of s and L. Furthermore, this rate is achieved uniformly 
over the Sobolev classes J^s,l with s £ [si, S2] and L belonging to any compact 
interval included in R"*". 

Here is the idea of its construction. The nonadaptive testing procedure pro- 
posed above depends on s only via the tuning parameter Na{s,L). In the fol- 
lowings, we will change the definition of N^{s,L) to avoid the dependence on 
L and we will write N^(s). Using a Bonferroni procedure like in Gayraud and 
Pouet [21] or Horowitz and Spokoiny [24], we consider the maximum of these 
tests for several values of Na{s), more precisely, we consider tests of the form 
4'criQ) = '^sjiN^j^fipa-iN^q). For this kind of test, the next proposition gives 
bounds for the first and second type errors: 

Proposition 1. Let M be a set of positive integers and denote ijjcriq) the test 
va.&'x.M^j\f ^a{N,q), where ipa is defined in 8, then 



Consequently, the set M has to be as small as possible (to control the first 
kind error), but rich enough to approximate the set of all Na{s) for s £ [si, 52]- 
We will show in the proof that each N £ M brings adaptation over all Sobolev 
balls of regularity s such that there is a 5 such that N = Na{S) and S < s < 
S + 1/logcr^^. Hence, we introduce the following notation leading to a proper 
choice of N. 

For every S2 > > 0, define 



/3(^,(<z),e^i'^) < minA,e^/3(Va(^,g),e^^^). 




(13) 



Theorem 2. Set 




(14) 



with C > 0, p„{s) = Cpl{s), p*{s) = {a^y^loga 



2s 
4s + 1 
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Consider the test = max7veA/'((Ti,(T2) V'o- (-^, \/2 log log cr^^^ , where is 
defined in (8). Then, for the interval [si, S2\ used in the construction of the test 
and for any interval [Li, L2] included in there is a constant C such that 

lim afe,eo) = 0, (15) 

cr— >0 

lim sup sup /3(V'o., 6i'^) =0, (16) 

Remark 2. In the statement of this theorem, one observes that the constants Li 
and L2 are not used in the definition of the test, while L was, in the definition 
of the nonadaptive procedure. Indeed, we optimized the separation constant C 
and gave an expression depending on L, while this optimization was not our 
matter in the second theorem. 

Remark 3. The theorem claims that there exists a value of C for which the first 
and second type errors can be controlled. From the proof of the theorem, we see 
that it is sufficient that such a constant satisfies 

J C2 - 4L|e<*=i+i>" - § > 



C > 



64 



V4S1+1 

which is verified when C > max (-^^Si=, i + (-L + 4L^e"(^^I+i)^)^/2) 



Heuristic for the performance of the adaptive procedure 

Here we explain why our adaptive procedure achieves the same rate as the 
nonadaptive one. The heuristic of the previous section roughly holds, with this 
difference that maxjv Xa{N) is of loglog-order under the null hypothesis. But this 
term is negligible in view of the perturbative term, so that the performances of 
the test do not deteriorate in the adaptive problem. 



3. Lower bound for the minimax rate 



After stating the performance of our tests, we prove in this section that they 
are at least nearly rate optimal. Indeed, we are able to establish a lower bound 
for our model, by proving that the detection of a signal lying in a Sobolev ball 
when the separation from is measured by the /2-iiorm (c/. (17) for a precise 
definition) is simpler than ours, in the sense that every lower bound result for 
this model is adaptable for our purpose. 

Let us first introduce the classical signal detection problem, for which the 
minimax separation rate, and even the exact separation constants, arc known: 

Y,=Cj+o-^j, j = l,2,..., 
< Qf^^^ = {0}, (17) 
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For this model, we define the errors of first and second fcind of a test ^'^^'^^^ by 

^, class /^/, class /2\class\ 1? //,/iClass\ 

(18) 

^classj-^class^ Qcl^s-^ ^ SUpQclass Ec(l - ^'^''"^^) , 

where we denote Pc the probabiUty engendered hy Y = {Yi,Y2,...) when 
(C1,C2, ...) =c. 

Theorem 3. Given the two models exposed in (2) and (17), we have 

inf/3(V'a,ei)> inf /3'^•'^^^(C'"'^ef'^^^), (19) 

where the infima are taken over all tests of level a respectively for our model 
and for the classical one. 

Thus, our model can benefit from every lower bound result on model (17). We 
choose to exploit the nonasymptotic results presented in Baraud [1], Proposition 
3. The following theorem shows that the asymptotic minimax separation rate 
for our problem is not smaller than (j4s/4s+i_ 

Corollary. Let a and (3 he in ]0, 1]. Define rj = 2(1 — a — (3), C = log(l + r?^) 
and p2 = suprf>i [V^Zda'^ A L^j^-asj ^ y^^g^ 

Pa<P => inf/3(V'",ei) >/3, (20) 

where the infimum is taken over all tests of level a for the shifted curve model. 
Remark 4. We can approximate p by computing 

sup [V2£^CT^ A i^x"^"] = L35TT(CT2y2£)^. 

aneR+ 

Remark 5. Our proof shows that every lower bound result for adaptive testing 
could be used for our purpose as well, for instance Gayraud and Pouet [21]. 



4. Discussion 
Model 

The choice of our model was inspired by practical considerations, and we intend 
to apply it to a problem in computer vision: that of keypoint matching as briefly 
discussed in Collier and Dalalyan [11]. Accordingly, it is necessary to justify the 
realism of model (2). 

Variance 

Although the theoretical analysis of this paper is carried out for the Gaussian 
sequence model, the procedure we propose admits a simple counterpart in the 
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regression model, at least in the case of deterministic equidistant design. Ac- 
cording to the theory on the asymptotic equivalence, our results hold true for 
this model as well, provided that s > 1/2 (c/. Rohde [36]). However, in the 
model of regression, it is not realistic to assume that the variance of noise is 
known in advance. 

Nevertheless, one can compute a consistent estimator of the variance (c/. Rice 
[35]) and plug this estimator in the testing procedure. In an analogous setup, 
it is proved in Gayraud and Pouet [20] for example, that this plug-in strategy 
preserves the rate-optimality of the testing procedure. We believe that a similar 
result can be deduced in our set-up as well. 

Symmetry of the model 

In our modelization, the two parts corresponding in the Gaussian white noise 
model to two different functions are treated symmetrically: the same model, with 
the same variance and the same noise, applies to both. But, in applications, the 
signals that we want to match with each other are thought to have the same 
nature. In addition, it seems that it is not meaningful to consider the case when 
the regularities of the Sobolev balls are different for the signals: under Hq, the 
regularity has to be the same. 

Besides, one could want to normalize both equations to get the same variance 
for both sides. But, this would also change the functions, which would not 
only differ from each other by a shift, but also by a dilatation. Therefore, the 
application of our methodology to this case is not straightforward. However, 
a detailed inspection reveals that our results carry over to the case when we 
replace a by max((T, ct*). 

Weighted estimator 

In Collier and Dalalyan [11], another estimator of cP{c,c*) is used, stemming 
from a penalization of the log-likelihood ratio. This could be adapted in our 
context by considering the test statistic 



where w = {wi,W2, . . .) is a sequence of real numbers in [0, 1] depending on a. 
Under some conditions on w, our study would undergo only few modifications, 
and only the optimal constants would be changed. For simplicity sake, we chose 
not to consider the weighted estimator. 

From classical signal detection to shift testing 

A first guess to try solving our problem could be to use an estimator f of 
the shift and to apply the classical signal detection methods to the sequence 




(21) 
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(Yj — e'^'^Yj*). But this approach fails, since it is not possible to get any consis- 
tent estimator of the shift. Indeed, for example, the shift may not be identifiable. 
Consequently, the study of the perturbative term (c/. first heuristic after The- 
orem 1) is unavoidable, in order to take into account every possible shift. We 
think that this uncertainty entails a price, i.e., a supplementary factor in the 
minimax separation rate. 

Future research 

Our model is only a simple version of the curve registration problem. In further 
work, we could study what happens when the signals are shifted and dilated by 
considering the pseudo-distance 

+ CXD 

d'(c,c*) = inf y |c, - ae'J'"c*p. 

r,a ^ — ^ 

J = l 

Once again, the problem is whether it is possible to estimate the dilatation 
parameter consistently. 



5. Proof of Theorem 1 
5.1. First kind error 

Here, we prove that the asymptotic first kind error of the test ipa- does not exceed 
the prescribed level a. To this end, denote r* a real number such that, under 
Ho, Vj > 1, c* = e'^'^'cj. We skip the dependence of r* on c and c*. Using the 
inequality 

JV„ N„ 7V„ 

minj: |y, - e--^^^Y*\' < J] \Y, - e~-^^^'Y*\' = ^ |e, - (^f" , 

i=i j=i j=i 

we get 



a(V'., Oo) - supPe,c« ( , , ^ minV \Y, - e-^^y/f - ^ > q^) 



where tj, = Re(^, - e-'^'^*^*), % = lm(0 - e''^^' 1' AA(0, 2). 
Finally, using Berry- Esseen's inequality (c/. Theorem 5), we get 

a{ipcr, 9o) < a + 



and this gives the desired asymptotic level. 
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5.2. Second kind error 



It remains to study the second kind error of the test, and to show that it tends 
to 0. Our proof is based on the heuristic given earher in Section 1: we decompose 
Acr(iVcr) into several terms, and make use of their respective orders of magnitude. 
The decomposition gives 



> mm { ^ |c, - e-'^-c*|2 + 2a ^ Re ((c, 



^C*))} (22) 



i=i 



For simphcity sake, we introduce some notation: 

' D,{c, c*) = min, { ^fj, |c, ~ e-^^c'^ 



+ 2aE,JiRe((c, 

I 2 I I t# |2 



_Bct — max^ 



which, combined with (22), leads to: 

Pii^a, ei) < supPc.c* (d,{c,c*) - CT^^/^A, - 2a^B, < Aq^a^^N^ 

In addition to Cs.l, introduced in the definition of we will need the constant 
c' and e, defined as 



256 Ca, I, 
4s+i ' 



i (C^ - AL^c-f 



256 Cs.l ' 

45 + 1 , 



Separating the different terms to study them independently, we write 

<SnpPc.c'(Da{c,C*) < (c' + £ + 



ei 



--)pI 



v/iogo^ 



+ P l^aW^aA^ > eplj + P i^2<j^B, > c! p, 
• Let us first study supe, Pc c« (^a(c, c*) < (c' + e + ^^S^^i)p^), which con- 

-y/log(T-l 

tains the dominant term when p^ is too large. 
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Denoting S ~ — '^L^c^ Lemma 1 allows to apply Lemma 2 with xq 



Spa- and M ~ (c' + e + xhe choice of the parameters yields for a 



small enough 



,6 c' + e QaJc^ , 



so that the second part of Lemma 2 holds 
supPc,c«( £'<t(c,c*) < (c' + e 



viogffi 

< 2 (^1 + r^Lp^i max{l, Ar^-'^}^ 



exp 



cr-yO 



0, since /Ocr/cr ^ as cr — >■ 0. 



• Let us now turn to P (ct^ v^iV^^cr > ^Pa)- Prior to using Berry-Esseen's 

2 

inequality (c/. Theorem 5), we derive ^Ji^Jj^ > 4Vca,L v ^°S^~"^' that, 
putting a: = ^ i — -y/loga^^ into the formula of the theorem and using the 



bound 1 — < ^^^^^ for every positive x, 



32CsL cr32c 



^ 0. 



TriVn- ' V Tre^ ^logo— i 
• Finally, it remains to control P(2<t^ B^r > c'p'^). We apply Lemma 3: 



p(2o-2b„ > c'pI] < 2c(logCT"^)^cr^"34 



+1 + e 



< 2c(logCT"i)=TT + e-^-/2 ~> 0. 



6. Proof of Theorem 2 
6.1. Proposition 1 

Let A/" be a set of positive integers and denote tpaiq) — la&xpf^j^ V'o-(-^, <?), where 
tpa is defined in 8. 

• Concerning the first kind error: 

N 

a{^l>^{q),Qa) = sup Pc.c« f max min V |y, - e^'J'^Y,*^ > 
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N 



< y sup Pc,c« f min y\Yj- e-'^^Y*\^ > q) 



• Concerning the second kind error: 

N 



P{^„{q),Q'{'') = sup P,,,. (max min ^ \Y, - g-'^'^F/p < q 



N 



< sup Pc,c* ( |Y, - e-y^r/p < , ViV € A/", 



< min/3(V',(iV,(j),e^i 



6.2. First kind error 

Here, we prove that the first kind error of the test -00- converges to 0. To this 
end, denote r* a real number such that, under iJo, Vj > 1, c* = e^^'^' Cj. We 
skip the dependence of r* on c and c*. Using the inequahty 

Af„ N„ 

2 ^ X ^ I , ^ — iiV* ^tt I 2 9 X ^ I ^ — iiV* 1 2 



j=i i=i j=i 

we get 

«(^.,eo)< ^ P 77T7 E(^l + - 4) > v/ 2 log loga-i , 

where 77^- = Re(Cj - 6"^^'^*),% = Im(^j - e-'^'^'^*) ''^'^ A/'(0, 2). 

Thus, using Berry-Esseen's inequality (c/. Theorem 5 with x = \/2 log log ti^^) 
and the bound 1 — <^(x) < for every positive x, 



^ V4-logloga-^ / 

1 Card7V(si,S2) 1 Card7V(si, §2) 



27r v^7V^(s2) \/4^1ogcr-Vloglogo-i' 

As CardA/'(si, S2) = 1 + [ (s2 — si) loga^^] is of logarithmic order, this implies 
that a(ijja,Qo) -> 0. 
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6.3. Second kind error 



Finally, we study the second kind error and prove that it converges to 0. 

For s G [si,S2], define S — max G S(si,S2) | t < s}, where we omit 
the dependence of in s for simplicity sake. Note that < s — S < j^jj^- S 
is an approximation of s which will be sufficient for our purpose according to 
Lemma 6. 

We introduce the notation 



+ 2aEf=f ^ Re ((c, - e-'^--c*)(0-e-im))}, 
At 



Bt = max^ 



ES^)Re(e'^-^^,^|) 



and computations similar to those of the previous section yield 

sup sup ^(?/i„,ej''^) 

[Ll,L2] [si,S2] 



c 



< sup sup Pc,c* (i^^ (c, c*) < aV32iV,(5) logloga-i + 



+ a^VA^^: > 2-'^^ > TpK^ 



• Let us study sup^ ^ supg^.,!- Pc,c» (-Df (c, c*) < cr^ y/ 32 Na{S) \og\og a ^ 



Lemma 6 implies 



so that, denoting 5^ — C"^ ~ 4i^e^^^i+^, Lemma 1 allows to apply Lemma 2 
with xo = 5pl{s) and M = 0-2^32 7V<^(S') log log cr-i + %pl{s). On the other 

hand, the choice of 5 entails that for C large and a small enough 



w ^ ^\*(^ ^V2A^.(5)logloga-i ^ ^ 

Hence, applying the second part of Lemma 5, we get an inequality where the 
right-hand side converges to as a tends to 0: 

sup sup Pc,c« (d^, (c, c*) < (7^ V32iV,(5) log log a- 1 + ^pl{S) 

S 0=.!. V 2 

< 2 f 1 + pa{s2)-^ max{l, N„{s^f-'^} 
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exp{-((5^-^)p^(.0 



V32 7V,(si)logloga-i)V32(5V'(si)fT2} 



exp 



I 8a2 / 



• Consider the second term. Berry-Esseen's theorem (c/. Theorem 5) imphes 
the following inequality, where the right-hand side converges to as cr tends 
to 0: 

C 



< CardA/'(si,S2) 



128 oris 



1TNa{s2) V TtC yiogO- 



• Let us turn to the third term. We apply Lemma 3 and get an inequality where 
once again the right-hand side converges to as cr tends to 0: 

C 



< CardA/'(si,S2) • 2(logcr"^) *-2+i crio24 4.1+1 _^g- 



7. Proof of Theorem 3 



Consider a randomized test ip in the shifted curve model. We will define a 
corresponding test in the classical model with smaller first and second kind 
errors, and it is sufficient to establish the result. 

First note that there is a measurable function / with respect to the cr-algebra 
engendered by the sequences Y and Y* and with values in [0, 1] such that -0 = 
/(y, y*). Denoting e a sequence of i.i.d random variables A/'(0, a^) independent 
from y, we define ip'^^^^^ = Ee(/(y, e)|y) , where Eg is the integration with 
respect to the probability engendered by e. ij)'^^'^^^ is cr(Y')-measurable and thus 
constitutes a test for the classical model. 

This testing procedure can be interpreted as a test in the shifted curve model 
when c* — 0. Indeed, d{c, c*) = ||c||2 when c* = 0, so that Q^"^^^ x C 9o and 
Qciass X C 61. By ToneUi-Fubini's theorem, satisfies 

^class^y^class Qclass) ^ SUp 

= sup Ee,o(/(y,y*)) 

< a(0,eo). 

A similar inequality holds concerning the second kind error. 
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8. Lemmas 

Lemma 1. Let c = (ci, C2, . . .) and c = (ci, C2, . . .) in -Fg.L, with s > 0, be such 
that d{c,c) > Cp, and let N + 1 > cp-^^\ Then 



N 



nin^ |cj - e-*^"cjf > (C" - A.L^c 



2„-2s\^2 



Proof of Lemma 1. Since both c and c belong to the Sobolev bah, it holds that 



j>N j>N 

<2(iV+l)-2^^j2«(| 



c.,l^ + l<5.,n 



< 4:L^{N + iy 

Consequently, taking into account that J^'jLi l^j ~ e^'-''^Cjp > o?^(c, c) > C^p^, 
we get 



N 



C7 C C-j 



> C^p'^ -AL^iN 
and the result follows in view 

Lemma 2. Let &e some positive integer, let j = 1,...,N be inde- 

pendent complex valued random variables such that their real and imaginary 
parts are independent standard Gaussian variables, and let c — (ci, . . . , cat), 
c = (ci, . . . , cjv) be complex vectors. Denote $, — (fi, . . . , ^jv), ^ = (^1, . . . , ^jv) 
and 

Da,N{c, c) = miur I J2f=i |cj - e"'-''^CjP 
+2<j Ef.i Re ((c,-e-'^-5,-)fe--e-»H-))}. 



^ WAr(C,C,c) = SUp^ 2^^-^;^ 



djv,T(c,e) 



j4sstime that xq < min-r o?Ar.r(c, c), t/ie^ 



VMeM, P i:'<,,jv(c,c) < M 



<2P(au^(|,c,c)> ^-^"j +2P('|^ <aii^(|,c,c)' 
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Assume further that c and c are in J-g^L o,nd that ^ — > 0, then combining 
the last result with Lemma 5, 



P\^Da,N{c,c) < Mj < 21^1 + L m(ix{l, N^-^'y^ 

X (exp{ - {xl- Mf/32xla^} + exp { -s^/Scr^}). 

Proof of Lemma 2. 



N N 



N 



o , , , , sr^^^l^iic-j — e~^i'^C.i)] 



2crc;Ar,^ C, c) > — 

^ dN,T(C,c) 



N 



> d%,ic, C) - 2CTdw,r C, C sup V ' , ''^ 

- 1^ dN^r(C,c} 



- 2crdjv,r(c, C) SUp > ^— - 

With the notation un{^, c, c) = sup^ X^jLi ^^dw^(lc) ''''' 

Da,N{c,c) > min(a;^ — ax), 



, we obtain 



with a = 2auN{^, c, c)+2cru7v(C c, c). Now, using the fact that min2;>2:o(a;^— ax) 
is reached at the point xg if xq > f , we get 



P D^^n{c, c)<M] < P x^ - 2xoauN{t c, c) - 2xo(7itAr(^, c, c) < M 



P Xq < <TUn{^, C, c) + CrMAr(^, C, c)) 



<2P[auN{^,c,c)>^-^^ 



since MAr(^, c, c) and un{^,c,c) have the same distribution. □ 
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Lemma 3. Let be independent complex valued random variables such that 

their real and imaginary parts are independent standard Gaussian variables, let 
c, s and a be some positive real numbers. Denote 



Ba = max,- 



Then, for a small enough and for every positive c' , 

Proof of Lemma 3. Applying Lemma 4, we state that, for a small enough, 



from which follows that 



We conclude, observing that ixpa-^^^^ y/clog{(j ^) = ^^2°/^ ■ □ 

Lemma 4. Let N be some positive integer and let ^j, ^j, j — l,...,N, be 
independent complex valued random variables such that their real and imaginary 
parts are independent standard Gaussian variables. Let u — {ui, . . . ,un) be a 
vector of real numbers. Denote S{t) = X^^i R-^ (e*-'*^j^j) for every t in [0, 2tt] 
and \\S\\oo = supjg[o,27r] \S{t)\. Then 

\fx, y > 0, P(||5||oo > V2x{\\u\\2 + yllwlloo)) < (iV + l)e-^'/2 ^ ^-y-/2^ 

Proof of Lemma 4- We refer to Collier and Dalalyan [11], Lemma 3, for a proof 
of this lemma. □ 

Lemma 5. Let c — (ci, C2, . . .) and c = (ci, £2, . . .) in J-s,l with s > and let 
N be an integer. Denoting r]j,fij A/'(0, 1), we define 

^ ^ rjj Rejcj - e-'J*Cj) + fjj Im(cj- - C'^Cj) 
J=i yz2j=i \c3-e '^'cj\ 

for every t in [0,27r]. Then 

\ / L-max{l,iVi-n \ 
p(||5|U>.t) < ( , ^ ^ ' ^ +l)e~^. 

min^X]j=i - e ^-J^c^p 
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First recall Berman's formula, that we will need in the proof. 

Theorem 4 (Bcrman [4]). Let N he a positive integer, a < b some real numbers 
and gj , j = 1, . . . , N be continuously differ entiahle functions on [a, b] satisfying 
9ji^)^ — 1 /o*" all t £ R and rjj, j = 1, . . . , N , some independent standard 
Gaussian variables. Then 

\ / .2 — 



with 

N 1 1/2 

I ' ' 



dt. 



Proof of Lemma 5. Denote 

R,g(c 




lm.{cj — e~^^*Cj) 



We compute the derivatives of these functions: 

Im(je^'-'*Cj) 



N 



^Im(fccfeCfee 



and .g (i) = =ii 



Im(cj - e-'J*Cj) 



|2 



whence 

N x^N ■2\~\2 



N 

^lm{kckCke~'''*), 



i2 max{l,iV2-2.i. 



- iV 

™int Efe=i |cfe - e-"=*Cfe|2 
The conclusion follows from Berman's formula. □ 
Lemma 6. Let a be a positive real number and s,S in [si,S2] C R+ be such 

2 s 

that < s — S < ■ Denote p*„{s) = (cr^-\/logcr^i) *°+^ , then, for a small 

enough, 

p*(S) ^ 
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Proof of Lemma 6. By the definition of pj(s), we fiave 




= (aVlog('T-i)) 



wfiicli, wlien a is so small that a^^Jloga ^ < 1, leads, with the hypothesis on 
s and S, 




< (aVlog(^-i)) 




Then, we compute 




4 



< g(4si+l)^ ^ 



and this concludes the proof. 



□ 



Finally, we recall here Berry-Esseen's inequality, in a simpler version than 
Theorem 5.4 of Petrov [33]. 

Theorem 5 (Berry-Esseen's inequality). Let N be a positive integer and some 
random variables Xi, . . . , Xn X and such that E(X) = 0, Var(X) — 7^, 
E|Xp = < +00. Denote Fn{x) = LjLi < ^) and $ the distri- 

bution function of the standard Gaussian variable. Then 



for an absolute constant number A. Moreover, in the case when X — — 1 and 
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